Categories

Versions

You are viewing the RapidMiner Studio documentation for version 10.0 - Check here for latest version

Generate n-Grams (Terms) (Text Processing)

Synopsis

Creates term n-Grams of tokens in a document.

Description

This operator creates term n-Grams of tokens in a document. A term n-Gram is defined as a series of consecutive tokens of length n. The term n-Grams generated by this operator consist of all series of consecutive tokens of length n.

Input

  • document

    The document port.

Output

  • document

    The document port.

Parameters

  • max_lengthThe maximal length of the n-Grams. Range: